OPEN 3 ACCESS Freely available online 



•0-PLOS I o-^E 



A DNA Sequence Recognition Loop on AP0BEC3A (jt\ 
Controls Substrate Specificity cros^rk 

Eric C. Logue^^ Nicolin Bloch^ ^ Erica Dhuey\ Ruonan Zhang\ Ping Cao^, Cecile Herate\ 
Use Chauveau\ Stevan R. Hubbard^'^, Nathaniel R. Landau^* 

1 Department of Microbiology, NYU School of Medicine, New York, New York, United States of America, 2 Kimmel Center for Biology and Medicine of the Skirball Institute, 
NYU School of Medicine, New York, New York, United States of America, 3 Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, 
New York, United States of America 



Abstract 

AP0BEC3A {A3A), one of the seven-member AP0BEC3 family of cytidine deaminases, lacl<s strong antiviral activity against 
lentiviruses but is a potent inhibitor of adeno-associated virus and endogenous retroelements. In this report, we 
characterize the biochemical properties of mammalian cell-produced and catalytically active £ co//-produced ASA. The 
enzyme binds to single-stranded DNA with a of 150 nM and forms dimeric and monomeric fractions. ASA, unlike 
AP0BEC3G {A3G), deaminates DNA substrates nonprocessively. Using a panel of oligonucleotides that contained all possible 
trinucleotide contexts, we identified the preferred target sequence as TC (A/G). Based on a three-dimensional model of A3A, 
we identified a putative binding groove that contains residues with the potential to bind substrate DNA and to influence 
target sequence specificity. Taking advantage of the sequence similarity to the catalytic domain of A3G, we generated A3A/ 
A3G chimeric proteins and analyzed their target site preference. We identified a recognition loop that altered A3A sequence 
specificity, broadening its target sequence preference. IVlutation of amino acids in the predicted DNA binding groove 
prevented substrate binding, confirming the role of this groove in substrate binding. These findings shed light on how 
AP0BEC3 proteins bind their substrate and determine which sites to deaminate. 
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introduction 

APOBEC3A (ASA) is one of the seven-member APOBEC3 
family of cytidine deaminases encoded in the human genome [1- 
3]. The protein has potent cytidine deaminase activity but unlike 
other family members, such as APOBEC3F (ASF) and APO- 
BEC3G (A3G), does not appear to have considerable activity 
against HIV- 1 . A3F and A3G are packaged in the viral core as the 
virus assembles, most likely through interactions with the viral 
genomic RNA in a complex with the nucleocapsid protein of Gag 
[4—14]. Upon reverse transcription, the packaged enzymes 
deaminate the minus-strand of the viral reverse transcript, 
resulting in G^A mutation upon synthesis of the plus strand 
[15-19]. The lentivirus accessory protein Vif counteracts A3F and 
A3G by binding to and promoting their degradation [20] but fails 
to bind and induce A3A degradation. A3A can be packaged into 
HIV-1 virions but lacks antiviral activity toward HIV-1 and SIV 
[21]. Given ASA cytidine deaminase activity and its ability to be 
packaged, it is unclear why it lacks antiviral activity. Some reports 
suggest that it inefficiently packages or that it is packaged such that 
it cannot access the reverse transcription complex. Fusion of ASA 
to Vpr or the N-terminus of A3G activates the antiviral activity of 



the protein, perhaps by causing it to associate more tightly with the 
virion core [22,23]. 

Expression of ASA is largely restricted to myeloid cells such as 
monocyte-derived macrophages (MDM) and monocyte-derived 
dendritic cells (MDDC). The expression of ASA is upregulated in 
these cells in response to type I interferon treatment, which 
suggests a role for ASA in the antiviral response [24-26]. While 
ASA has little activity against lentiviruses, it potendy inhibits the 
parvoviruses, adeno-associated virus (AAV) and minute virus of 
mice (MVM) [2S]. It also has strong activity against human T- 
lympho tropic virus type I (HTLV-1) [27] and endogenous 
retroelements [21,28,29]. The mechanism by which ASA restricts 
AAV and retroelements is unclear. The inhibitory activity of ASA 
against both elements is dependent on amino acids in the catalytic 
active site yet no increase in the frequency of C^T or G^A 
mutations were detected. Moreover, a mutated ASA that lacks 
catalytic activity retains activity against AAV [28] . In addition to 
its antiviral role, ASA was reported to clear foreign DNA from the 
cytoplasm of monocytes in a cytidine deamrnation-dependent 
manner [26]. This function might explain how ASA restricts HIV- 
1 in macrophages even though it does not appear to mediate 
cytidine deamination of the HIV- 1 genome when packaged into 
the virus [24,25,S0,S1]. 
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ASA, like all APOBEC3 family members, specifically deami- 
nates single-stranded DNA (ssDNA). It consists of a single 
deaminase domain, a feature in common with APOBEC3C and 
APOBEC3H but different from APOBEC3B, APOBEC3D, A3F, 
and A3G wliich have two deaminase domains. When expressed in 
myeloid cells, ASA localizes in the cytoplasm and is not genotoxic 
[32]. However, upon ectopic expression, ASA localizes to both the 
cytoplasm and the nucleus and causes DNA damage [33]. This 
localization to both the cytoplasm and nucleus is in contrast to 
A3F and A3G, which are largely cytoplasmic. A3G acts as a dimer 
to processively deaminate ssDNA [34], while a recent study- 
suggests that ASA does not deaminate processively and functions 
as a monomer [35]. ASA has sequence homology to the carboxy- 
terminal domain of A3G [2] and mutagenesis studies identified 
several conserved amino acids that are critical for deaminase 
activity in both ASA and A3G [36]. The initial characterization of 
ASA identified (T/ C) C A as the preferred trinucleotide motif for 
deamination [21], which contrasts with the CCC trinucleotide 
preferred by ASG. How the sequence differences between ASA 
and ASG account for this altered sequence specificity has not yet 
been determined. 

Here, we biochemically characterize the enzymatic activity of 
catalytically active ASA produced in E. coli and in mammalian 
cells. We found that the protein is a dimer that acts nonproces- 
sively with a preference for a TC (A/G) trinucleotide. By 
generating chimeric proteins in which loops of ASA were 
exchanged with those of ASG, we identified a putative sequence 
recognition loop, which determines the deamination target site 
specificity. We also identified several residues in the putative 
ssDNA binding groove and a second putative recognition loop that 
were required for ASA interaction with ssDNA and identified two 
amino acids that are specific to ASA that appear to play a role in 
reducing the affinity of the protein for ssDNA. 

Results 

A3A Forms Multimers 

To biochemically characterize ASA, we generated highly 
purified recombinant ASA (rASA) in E. coli (Fig. lA). The rASA 
was expressed as a GST fusion protein, affinity purified using 
glutathione beads and then released from the matrix by cleavage 
with factor Xa. In the preparation of the protein, we found that its 
expression was highly toxic to the bacteria and that upon 
induction it induced G-^A mutations in the expression vector 
used to express it. We found, however, that minimizing the 
induction phase allowed us to produce large quantities of pure 
protein. The resulting protein was catalytically active since it 
retained its ability to deaminate oligonucleotide targets in a 
deaminase assay (Fig. IB). It also retained its ability to bind to 
ssDNA targets widi a calculated of 158 ±70 nM (Fig. IC). To 
determine the dimerization state of ASA, we subjected the rASA to 
size exclusion chromatography (SEC). Analysis of the fractions 
showed that rASA ran as both a monomer and a dimer in the 
buffer used for measuring deaminase activity (Fig. 2A). While the 
majority of rASA appears to be monomeric, there is a significant 
fraction that exists as a dimer. Both monomeric and dimeric 
fractions displayed high deaminase activity (Fig. 2B and Fig. SI). 

Since the production of ASA in bacteria or conditions used 
during the SEC might have led to the observation of artifactual 
multimers [37], we asked whether such multimers could be 
observed upon expression of ASA in mammalian cells. We 
therefore cotransfected 293T cells with expression vectors for HA 
and FLAG-tagged ASA. We then immunoprecipitated with anti- 
HA antibody and analyzed the proteins on an immunoblot probed 



with anti-FLAG antibody. This analysis showed that FLAG- 
tagged protein coimmunoprecipitated when expressed with A3A- 
HA, demonstrating that ASA forms ofigomers in the cell (Fig. 2C), 
although it does not distinguish dimers from higher order 
multimers. 

ABA Deamination is Nonprocessive 

ASG processively deaminates ssDNA. To determine whether 
ASA also deaminates processively, we established a deaminase 
assay using an oligonucleotide substrate containing a fluorescein 
tag flanked by two TCG trinucleotides (Fig. 3A). This assay 
allowed us to determine whether deamination occurred at the 5' 
deamination site, the 3' deamination site or at both sites. We 
found that rASA preferentially deaminated the 5' site (Fig. SB). 
Using these data, we calculated the processivity factor, a measure 
of whether double deamination occurs more frequently than that 
predicted by the deamination frequency at the two independent 
deamination sites. The calculated processivity factor was less than 
or equal to one at all time points tested, indicating that double 
deamination occurred at the frequency expected for the two 
independent sites. This suggests that ASA deaminates ssDNA 
using a nonprocessive mechanism (Fig. 3C). 

We further tested processivity by determining the ability of 
rASA to deaminate an oligonucleotide substrate containing 
deamination sites arrayed in tandem. Following this deamination 
reaction, the oligonucleotide substrate was subjected to one round 
of amplification, cloned into TOPO-TA vector and the resulting 
clones were serjuenced (Fig. 3D). Eight of the twenty sequenced 
clones had no deaminations (0 sequential deamination sites). The 
clones that had deamination sites showed littie evidence of 
sequential deamination since most of the deaminated sites lacked 
adjacent deaminated sites (Fig. SE). The few sites that did have 
sequential deamination sites generally had no more than four sites 
sequentially deaminated. These data further suggest that ASA 
utilizes a nonprocessive deamination mechanism. 

A Putative Recognition Loop Constrains ABA Sequence 
Specificity 

We used a panel of four oligonucleotides to determine the target 
site preference of ASA, each of which contained four tandem 
deamination sites. The four 5'-biotinylated oUgonucleotides varied 
in the nucleotide 5' of the target cytosine (N, different for each 
oligo), and each oligo contained four different deamination sites 
differing at the nucleotide 3' of the target cytosine (NCA, NCT, 
NCC, and NCG). Each oligonucleotide was incubated with rASA, 
and the products were separated on a gel and analyzed as in the 
in vitro deaminase assay. The analysis demonstrated that rA3A 
preferred a TC dinucleotide (Fig. 4A). Further deamination 
experiments using oligonucleotide substrates containing a single 
deamination site showed that TC followed by A or G was the 
preferred trinucleotide motif (Fig. 4B). Thus, the consensus target 
for ASA is TC (A/G). These results are in agreement with a 
previous study investigating the 5' context constraints of the 
deamination site [35] yet also describe specificity of ASA toward 
the S'-nucleotide context as well. 

The TC (A/G) consensus deamination site of ASA differs from 
the CCC consensus site of ASG. The catalytic domains of ASA 
and ASG are very similar in amino acid sequence and thus, 
relatively few amino acids are likely to determine this sequence 
preference. In order to test this hypothesis, we compared the 
sequences and structures of ASA and ASG (Fig. 5A and Fig. S2). 
To understand which amino acids might play a role in 
determining the sequence preference of the enzymes, we 
compared the NMR structure of ASA [38] to the crystal structure 
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Figure 1. rA3A produced in £ co// binds and deaminates ssDNA. A) rA3A was visualized by Coomassie staining B) The extent of deamination 
was determined for rA3A incubated with ssDNA for 2, 5, 10, 20, 40, or 60 minutes. C) DNA binding was determined by measuring the change in the 
intrinsic fluorescence of rA3A following incubation with an increasing amount of ssDNA. 
doi:1 0.1 371 /journal.pone.0097062.g001 
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Figure 2. ASA homodimerizes in vitro and in vivo. A) top. rA3A was run on a size exclusion column, bottom. A3A content was tested for each 
fraction by immunoblot using the anti-A3A antibody. B) The overall protein content of each fraction was determined by Bradford assay, and catalytic 
activity was determined by in vitro deaminase assay with a biotinylated TCA oligo using an equal volume of each fraction. C) 293T cells were 
transfected with expression vectors for HA, FLAG-tagged A3A or both. The cells were lysed, and the HA-tagged A3A was immunoprecipitated. The 
immunoprecipitates were then analyzed on an immunoblot. 
doi:1 0.1 371/journal.pone.0097062.g002 
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Sequential TCA sites deaminated 



Figure 3. ABA deamination is nonprocessive. A) An oligonucleotide containing a fluorescein tag flanked by consensus deamination sites was 
incubated with rA3A. B) The oligonucleotide was incubated with rA3A for 2, 5, 10 and 20 minutes, and single or double deamination at the 5'- and 3'- 
target sites was determined. C) The processivity factor was calculated for each time point. D) An oligonucleotide containing a series of target sites 
was incubated with rA3A and deamination states were identified by DNA sequencing. E) The frequency of deaminations at sequential target sites 
were quantified. Eight of the sequenced oligonuleotides contained no mutations and are indicated as having 0 sequential deamination sites. 
doi:1 0.1 371 /journal.pone.0097062.g003 



of A3G [39]. The comparison showed diflFerences in the two loop 
regions located adjacent to the likely substrate-binding groove. 
These could be substrate recognition loops. In addition, ASA has a 
tryptophan and glycine residue (WG 104-5) that is absent from 
A3G. The ASA NMR structure shows that the side-chain of W 104 
points into the substrate pocket and thus might influence substrate 
specificity (Fig. 5A and Fig. S2). 

To determine whether these diflFerences account for the 
substrate specificities of ASA and ASG, we generated chimeric 
proteins in which recognition loop 1 (RLl) and recognition loop 2 
(RL2) in ASA were replaced by the corresponding recognition 
loops of ASG. In addition, we generated ASA with a W104A 
mutation or a deletion of W 104 and G105 (AWG 104-5). To 
determine the catalytic activity of the mutated enzymes, we 
expressed them in transfected 29ST cells, immunoprecipitated 
them, and tested the catalytic activity of the proteins in an in vitro 
deaminase assay. The results showed that the RL2 ASA and 
W104A mutants of ASA were inactive. In contrast, RLl and 
AWG 104-5 ASA retained activity against a TCA oligonucleotide 
(Fig. 5B and 5C). There was the possibility that the mutated 
proteins changed their target sequence preference. We tested the 
RLl and AWG 104-5 ASA for the ability to deaminate oligonu- 
cleotides containing every combination of trinucleotides. These 
results showed that the mammalian cell-expressed wild-type ASA 
had the same site preference as the E. coli-produced enzyme. 
AWG 104-5 ASA maintained the preference for TC (A/G). In 
contrast, the RLl chimera was active, but was more flexible with 
respect to the nucleotide 5' to the deaminated C, deaminating AC, 



CC and GC with increased frequency (Fig. 5D and 5E). These 
data suggest that the presence of GI25-26 in RLl of ASA specifies 
the 5' nucleotide while swapping the EPWA^R residues from RLl 
of ASG relieves this constraint. 

Mutational Analysis Confirms the Role of the Putative 
DNA Binding Site on ASA 

The crystal structure of ASG shows a shallow groove in ASG 
that surrounds the catalytic core and is postulated to be the 
binding site for ssDNA [40]. A potential DNA-binding groove is 
also apparent in the NMR structure. We identified residues H70, 
W98, R128 and YISO at the base of the groove as having the 
potential to interact with a DNA substrate (Fig. 6A). To test 
whether these amino acids are required for DNA binding, we 
generated alanine point mutations and tested their catalytic 
activity in the in vitro deaminase assay (Fig. 6B). The results 
showed that each of these residues was required for deaminase 
activity. 

Because these residues surround the catalytic core, it is possible 
that the loss of deaminase activity was due to alterations in 
catalytic function rather than an inability to bind DNA. To 
determine the role of these amino acids in ssDNA binding, we 
expressed the mutated proteins and purified them from E. coli. To 
increase the yield of these recombinant proteins, we generated 
each mutated protein in an E72A background. The E72A 
mutation inactivates the catalytic activity of the enzyme, allowing 
for increased production in E. coli without affecting ssDNA 
binding. We then measured the DNA binding using two different 
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Figure 4. The preferred consensus deamination target site of ABA is TC (A/G). A) rA3A was incubated with ssDNA substrates containing 
four different target site sequences. B) rA3A was incubated witli ssDNA substrates containing different target site sequences. 
doi:1 0.1 371 /journal.pone.0097062.g004 



assays. In the first assay, binding was determined using a gel 
retardation assay wliile the second assay was a fluorescence assay. 
In the gel retardation assay E72A, E72A/W104A, E72A/ 
AWG104-5 and the E72A/RL1 chimera bound ssDNA. Interest- 
ingly, mutation of the tryptophan residues appeared to enhance 
ssDNA binding, suggesting that the WG 104-5 insertion in ASA 
decreases substrate-binding affinity. The RL2 chimera and 
mutations of the four binding groove residues were inactive in 
the binding assay (Fig. 6C). Thus, the amino acid residues in the 
groove regulate DNA binding to ASA. 

For the fluorescence-binding assay, an oligonucleotide contain- 
ing a fluorescein-modified nucleotide was incubated with increas- 
ing amounts of rASA. Fluorescence intensity measured by 
spectroscopy allowed us to generate a binding curve to determine 
the K(j of the interaction. The analysis showed that the E72A and 
the E72A/RL1 ASA bound to ssDNA with a similar to that of 
wild-type. The E72A/WG104A and E72A/AWG104-5 proteins 
bound DNA with higher alTmity than wild-type, consistent with 
the gel retardation assay results. The E72A/RL2 chimera and 
proteins containing point mutants in the groove lacked measurable 
ssDNA aflinity (Fig. 6D). 

ASA is Induced by Type-I and Type-ll IFN 

Despite its potent cytidine deaminase activity and its ability to 
inhibit AAV, whether ASA has antiviral functions in vivo is not 
clear. One common feature of host proteins involved in the 
antiviral response is that they can be induced by type-I IFN or 
type-II IFN. To determine whether ASA is induced by either IFN, 
we utilized a polyclonal rabbit antiserum raised against the rASA, 
which showed high specificity for ASA (Fig. SS). We treated 
primary human monocytes, MDM, MDDC and CD4^ T cells 



with type-I IFN or type-II IFN and detected the proteins on an 
immunoblot probed with this rabbit antiserum. This experiment 
confirmed the findings from Koning et al. [24] and Peng et al. [25] 
that type-I IFN treatment upregulates ASA protein levels in MDM 
and monocytes (Fig. 7). We extended these findings by demon- 
strating that type-I IFN treatment also slightly upregulated ASA 
protein levels in MDDC and that type-II IFN increased ASA 
protein levels in MDM. We also demonstrated that this expression 
is cell-type specific since neither type-I IFN nor type-II IFN 
treatment of CD4+ T cells upregulated ASA to a detectable level. 
These fmdings are consistent with a role for ASA in anti-viral 
responses in myeloid cells. 

Discussion 

A detailed analysis of deamination target site preference shows 
that the consensus deamination sequence for ASA is TC (A/G). 
This finding is similar to the (T/C) CA consensus previously 
reported by Chen et al. [41] and the TC identified by others 
[35,42]. We find that the ASA preferentially targets a S' purine. 
This target site preference distinguishes ASA from A3G, which 
targets CCC. The amino acid sequence similarity of the catalytic 
domains of ASA and ASG allowed us to generate chimeric 
proteins to determine the domains of the protein that specify the 
target site preference. The analysis identified a recognition loop 
termed RLl in the amino-terminal domain of ASA. Replacement 
of RLl with the corresponding sequence of ASG did not exchange 
the target site preference to CCC but allowed for deamination 
irrespective of the 5' and 3' nucleotides. Unlike ASG, ASA 
deaminates ssDNA substrates by a nonprocessive mechanism, 
consistent with the findings of Love et al. [S5]. 
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Figure 5. The target site preference of ABA is influenced by RL1 . A) A model of ASA on the A3G crystal structure shows loops RL1 and RL2 (in 

red) flanking the proposed DNA binding groove and shows the inserted the WG amino acids (in orange) close to the conserved catalytic E72 (in 
green). IVlammalian expression vectors for ASA in which RL1 or RL2 of A3G was swapped into ASA (RL1 and RL2) and a mutated ASA deleted for the 
WG were generated. B) The mutated proteins were expressed in transfected 29ST cells, immunoprecipitated from cell lysates and then tested for 
deaminase activity against an oligonucleotide containing a TCG target sequence. C) The deamination activity of the mutated proteins was expressed 
as the ratio of the intensity of the deaminated product to the sum of the intensities of the unmodified and deaminated species. D) The target site 
specificity of the immunoprecipitated proteins was determined by incubating with oligonucleotides containing an NCN target site. The relative 
percentage deamination was determined as the percent deamination of the NCN oligonucleotide normalized to the TCA oligonucleotide. E) The 
target site specificity was determined as the ratio of deaminase activity determined for the NCs oligonucleotides to the total deamination activity. 
doi:10.1S71/journal.pone.0097062.g005 



The mechanism by which RL 1 determines target site specificity 
of A3A is not clear. One possibility is that residues G25 and 126 in 
RLl directly contact the DNA substrate; however, the NMR 
structure reported by Byeon et al. [38] did not show an interaction 
between a ssDNA substrate and residues of this loop. Alternatively, 
the chimera may extend RLl by three amino acids, allowing more 
flexibility within the substrate-binding groove and less target site 
constraint. This would not explain how the chimeric protein site 
preference is affected at both the 5' and 3' flanking nucleotides. 
Deletion of three amino acids in the RL 1 of A3 A in the human 
lineage was recently proposed to impair its antiviral activity against 
HIV-1 enveloped SIV [43]. The RLl chimera, which has an 
insertion of three amino acids, can be packaged in HIV- 1 virions 
but is not antiviral (data not shown), suggesting that RLl is not 
solely responsible for the lack of antiviral activity of ASA against 
HIV-1. 

Kohli et al. [44] have found that swapping of the recognition 
loop corresponding to RL2 in AID to that of A3G switched the 
target site preference. We found that swapping RL2 of A3G onto 
ASA caused the loss of deaminase activity, consistent with a similar 
swap reported by Narvaiza et al. [28]. The chimeric protein failed 
to bind ssDNA, accounting for its loss of catalytic activity. This 
fmding is consistent with the NMR structure, which shows that this 
loop contacts the ssDNA substrate. Thus, RL2 affects target site 
preference but its function is influenced by other sequences in the 
enzyme. 



Given the proximity of amino acids WG 104-5 to the active site 
cysteine-coordinated 7,v?^, it is surprising that insertion of these 
amino acids are tolerated yet deletion has no effect. In the 
evolution of these proteins, the insertion of these amino acid 
residues into ASA in new world monkeys and hominids must have 
occurred following the divergence of ASA and A3G. The deletion 
of the two amino acids caused a three-fold increase in afiinity of 
the protein for ssDNA resulting in an aflinity similar to that of 
ASG for its substrate. Mitra et al. [45] recently reported that a 
W104A mutation maintained catalytic activity at a reduced level 
while a G 1 05 A mutation resulted in a complete loss of catalytic 
activity. It is hard to understand how a conservative mutation at 
G105, where the mutated residue differs only in the presence of a 
methyl group, results in a complete loss of activity while a non- 
conservative mutation at the adjacent W104 residue did not. In 
light of the results of Mitra et al., we did an additional kinetic 
analysis to definitively determine the deaminase activity of the 
W104A ASA. The results again confirmed our finding that the 
W104A, RL2 and E72Ci rnutants lacked detectable deaminase 
activity (Fig. S4). Interestingly, the W104A mutant also had an 
increased affinity for ssDNA. This suggests that W104 plays an 
important role in deamination, perhaps by ensuring the proper 
positioning of the ssDNA substrate. This would be consistent with 
the ASA NMR model that indicates that W104 contacts the 
ssDNA substrate [38]. However, this contact must only be 
necessary when the adjacent G105 residue is present, as the 
ASG-like AWG 104-5 mutant retains deaminase activity. 
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Figure 6. Residues in the putative DNA binding cleft determine binding to ssDNA. A) The ABA model predicts four conserved amino acids 
that could play a role in DNA binding. Each was mutated to alanine. B) The mutated ABA proteins were expressed in transfected 293T cells, 
immunoprecipitated from cell lysates and tested for deaminase activity against a deoxyoligonucleotide DNA containing a TCG target site. C) £ coli 
expressed mutated ABA proteins in the background of the E72A catalytic site mutation were purified and similar amounts of each were analyzed for 
ssDNA binding in the gel retardation assay. D) A fluorescein tagged oligonucleotide was incubated with increasing amounts rABA from (C). The 
change in the extrinsic fluorescence at each protein concentration was used to fit a one-site saturation ligand-binding curve. The calculated Kd for 
each curve is displayed on a table together with the standard error. ND indicates that a could not be determined. 
doi:1 0.1 B71 /journal.pone.0097062.g006 



Surprisingly, the WG motif in the ASA of hominids has been 
replaced by a single tryptophan in Grivets, an arginine in Rhesus 
macaques and Colobus monkeys and deleted in new world 
monkeys. This motif was recently identified as being part of the 
ssDNA binding interface [38]. The species differences in the 
sequence of this loop of ASA suggests that this domain of the 
protein is under selective pressure. The amino acid changes could 
allow the protein to be less processive, thereby facilitating 
deamination over a larger span of viral genomes. Mutation of 



the tryptophan to alanine disrupted ASA catalytic activity but not 
ssDNA binding suggesting that the residue is involved in 
positioning of the ssDNA for deamination. In addition, decreased 
ssDNA binding affinity of ASA through mediated by alterations of 
the WG motif might help to prevent deamination of the cell's 
genomic DNA, a problem that may be greater for ASA, which 
localizes to the nucleus, than other APOBECS proteins which are 
cytoplasmic. The low ssDNA binding affinity of ASA may account 
for its failure to deaminate retroviral DNA by causing it to localize 
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Figure 7. Type-! and type-ll IFN induce A3A in monocytes, iVIDM and MDDC. Human monocytes, MDM, MDDC and activated CD4+ T cells 
were treated with IFN-p or IFN-y. After 24 h the cells were lysed and protein levels determined by immunoblot using a rabbit antiserum specific for 
ABA. Similar results were obtained with cells purified from the blood of three independent donors. 
doi:1 0.1 B71/journal.pone.0097062.g007 
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outside of the viral core in virions [22,23] . Lastly, the decreased 
affinity may provide a more dynamic interaction with ssDNA to 
allow it to deaminate incoming viral DNA, a feature that is unique 
to ASA among the APOBEC3 family members [30,31]. 

The crystal structure of A3G has a groove that runs across the 
catalytic core of the protein that is thought to bind ssDNA [40] . 
Our analysis suggests a similar groove in ASA, as amino acid 
residues at the base of this structure affected ssDNA binding. 
Amino acids within the groove were essential for deaminase 
activity, consistent with previous reports [36,38]. These residues 
are in a region of the protein that appear to contact ssDNA in the 
NMR structure [38,45]. The DNA binding characteristics of ASA 
mutated at these residues suggests that these residues participate in 
DNA binding. ASA with mutations that prevent DNA binding 
retained the ability to be packaged into HIV-1 virions (Data not 
shown). A3G packaging rerjuircs an interaction with the viral 
genomic RNA [7], suggesting that RNA and DNA binding is 
mediated by different domains of the ASA. These experiments 
further suggest that this groove in ASA is a DNA binding site, in 
agreement with the recent report of Mitra et al. [45] . 

ASA has been shown to exist primarily as a monomer 
[35,38,45,46]; however, by coimmunoprecipitation of epitope 
tagged ASA proteins from 293T cells and by size exclusion 
chromatography using E. co&-produced ASA we detect that a 
fraction of the protein was present as multimers. The detection of 
multimers by coimmunoprecipitation from mammalian cells 
suggests that they are present in vivo. Whether dimers might have 
altered biological properties is not clear. Both forms were 
catalytically active. Dimerization could allow for more processive 
deamination. It could also affect the ability of ASA to package in 
HIV-1 virions. Our results dilfered from previous reports with 
respect to the affinity we measured for ASA binding to ssDNA. We 
measured a 150 nM while other groups measured a much 
lower aflinity for ASA and ssDNA. We cannot explain this 
difference; however, ASA ssDNA binding is highly sensitive to pH 
[46] . It is also possible that the presence of ASA dimers in our 
preparations could have increased the binding affinity measured. 

Despite extensive efforts, we were unable to establish a 
mammalian cell line that stably expressed ASA by lentiviral vector 
transduction while control cell-lines that expressed catalytically 
inactive mutated ASA were readily estabUshed (not shown). The 
protein as produced in E. coli was highly active such that even 
when not induced, it introduced frequent mutations in the plasmid 
expression vector itself The potent deaminase activity of ASA may 
explain why its expression in mammals is restricted to nondividing 
myeloid cells and why it is induced only by specific stimuli. ASA 
inducibihty by type-I and type-II IFNs suggests it plays an 
important role in antiviral defense. It has been shown to restrict 
human T cell leukemia virus type 1 and Rous Sarcoma Virus 
[27,47] but it is unclear as to whether these are its natural targets. 
A precise definition of target site preference will be useful in 
identifying A3A-generat("d mutations in pathogens. In addition, 
the ability to alter the target site preference of ASA may allow for 
tailoring the protein to alter its ability to restrict the replication of 
pathogens. 

Materials and Methods 

Cell Culture 

293T and HeLa cells were cultured in Dulbecco's modified 
Eagle medium supplemented with 10% fetal bovine serum and 
penicillin and streptomycin. Primary monocytes, monocyte- 
derived macrophages (MDM), monocyte-derived dendritic cells 
(MDDC) and T cells were cultured in RPMI 1640 supplemented 



with 10% FBS and penicillin and streptomycin. Monocytes were 
purified from leukocyte enriched blood samples obtained from 
anonymous donors by the New York Blood Center using anti- 
CD 14-conjugated magnetic beads (Miltenyi Biotec) and differen- 
tiated into MDM by culturing for 7 days with 50 ng/mL GM-CSF 
or into MDDC by culturing with 50 ng/mL GM-CSF and 
100 ng/mL IL-4 (Invitrogen). MDM and MDDCs were plated in 
12 well plates at 1.0x10^ cells per well and cultured for 20 h with 
or without 2000 units of Universal T)pe I IFN (PBL Biomedical 
Laboratories) or 2.0 |ig/ml y-lFN after which they were lysed in 
protease-supplemented lysis buffer (50 mM Tris pH 7.5, 150 mM 
NaCl, 2 mM EDTA and 1% NP40). 

Plasmids 

Mammalian expression vectors for FLAG-tagged, HA-tagged, 
and Myc-His tagged ASA were generated by PCR amplification of 
the coding sequence with oligonucleotide primers containing 
EcoR-I and Xho-I sites and ligated to pcDNA6 A/myc-his 
(Invitrogen). Amplicon lacking a Kozak sequence was cloned into 
pET42a+ to generate the pET42.A3A for expression of a GST- 
ASA fusion protein in E. coli. Point mutations were introduced into 
ASA by overlapping PCR. All plasmids were confirmed by 
nucleotide sequencing and expression of the encoded proteins was 
confirmed by immunoblot analysis. 

Recombinant ABA Expression and Purification 

E. coli BL21 (DE3) CodonPlus RIPL cells (Stratagene) were 
transformed with pET42a-ASA and single colonies were cultured 
overnight and used to seed 500 ml of Luria broth. The cultures 
were grown for 5 h at S7°C and induced for 2 h with 200 |J,M 
IPTG. The bacteria were pelleted, freeze thawed, resuspended in 
PBS, sonicated and lysed in buffer containing 1% Triton-XlOO. 
The lysate was treated for 1 h at rm. temp, with 25 units/ml of 
Benzonuclease (Novagen) and then clarified by centrifugation for 
10 min at 6000 xg. The recombinant protein was collected on 
500 |tl of glutathione agarose (Sigma) for 16 h at 4°C. The resin 
was washed with PBS and resuspended in buffer containing 
50 mM Tris-HCl (pH 8.0), 100 mM NaCl, 5 mM CaCla 
(pH 8.0). The protein was cleaved from the beads with 10 U 
Factor Xa (Novagen) and used for size exclusion chromatography 
or concentrated on YM-10 (Millipore) and stored at — 20°C in 
50% glycerol. rASA, without concentration, was analyzed by 
FPLC on a Superose-12 gel filtration column (GE) with protein 
standards. 1.0 ml fractions were collected and the protein 
concentration and deaminase activity was determined for each 
fraction. 

Mammalian A3A Expression 

293T cells (5.0x10^) were transfected with 4.0 ng pcDNA6- 
ASA-myc-his using Lipofectamine 2000 (Invitrogen). After 48 h, 
the cells were lysed in lysis buffer (50 mM Tris pH 8.0, 40 mM 
KCl, 50 mM NaCl, 5 mM EDTA, 0.1% Triton-XlOO, 10 mM 
DTT, protease inhibitors). To immunoprecipitate ASA, the lysates 
were precleared with protein-G sepharose (GE Healthcare) and 
immunoprecipitated with anti-myc antibody (Covance) and 
protein-G sepharose for 2 h at 4°C. Unbound protein was 
removed by washing and the beads were resuspended in 
deaminase buffer (40 mM Tris pH 8.0, 10% glycerol, 40 mM 
KCl, 50 mM NaCl, 5 mM EDTA, 5 ^M DTT). 

In vitro Deaminase Assay 

E. coli expressed ASA (1.0 |a,g), FPLC fractions (10 jxl) or 
mammalian expressed ASA bound on sepharose beads (5 jtl) was 
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incubated wdth 10 pmoles of 5 '-bio tin-labeled oligonucleotide in 
deaminase buffer for 60 min (E. coli ASA), or 1-6 h (mammalian 
ASA) at 37°C. The ASA was inactivated for S min at 95°C and 
then treated for 30 min at 37°C with I unit UDG in buffer 
containing 20 mM Tris pH 8.0, I mM DTT, 1 mM EDTA. The 
product was hydrolyzed for SO min in 140 mM NaOH at 37°C. 
The resulting oligonucleotide fragments were separated on a 15% 
TBE-urea PAGE gel and transferred to a nylon filter (Invitrogen). 
The m(^mbranes were UV rrosslinked, blocked with 5% milk and 
inc:ub;itc'd for 30 min. with 1.0 [Ig of DyLight 800 conjugated 
Streptavidin (Pierce). Signals were quantified on an Odyssey 
Imager (LI-COR). The percent deaminaUon was defined as the 
amount of cleaved oligonucleotide divided by the sum of the 
cleaved and uncleaved oligonucleotides. 

Substrates 

Most of the deaminase assays used a biotinylated ohgonucleo- 
tide with a single TCA site (5'-biotin-T28(TCA)T28-S') (Integrated 
DNA Technologies). To determine the deamination site prefer- 
ence of rASA, a set of biotinylated oligos containing all possible 
three-base combinations were used (5'-biotin-Ti4(ACA)T7(ACT)- 
T9(ACG)Ti7(ACC)T3i-3', 5'-biotin-Ti4(TCA)T7(TCT)T9(TCG) 
Ti7(TCC)T3i-3', 5'-biotin-Ti4(CCA)T7(CCT)T9(CCG)Ti7(CCC) 
T31-S', 5'-biotin-Ti4(GCA)T7(GCT)T9(GCG)Ti7(GCC)T3i-3'). 
Trinucleotide deaminase specificity was determined using bioti- 
nylated oligos with each possible trinucleotide context: NGN sites 
(5 '-biotin-T2«(NCN)T28-3 '). 

Deamination Sequencing 

An oligonucleotide containing repeating rA3A preferred deam- 
ination sites (5'-biotin-GGG GGT AGA TTG AGG TA-\ OTA 
(TCA)i8TGA ATA GGA GTG TTA AGG GGG-3') was treated 
with 50 ng rA3A in deaminase assay buffer. The deaminated 
DNA was cloned into TOPO TA (Invitrogen) and sequenced. 

Anti-A3A Antiserum Production and Immunoblot 
Analysis 

Anti-A3A antiserum was raised at Pocono Rabbit Farm & 
Laborators' (PRF&L) by repeated immunization of 2 rabbits with 
E. co/j-produced rA3A from which the GST had been removed. 
Protein lysates (30 ng) were separated on a 4-12% SDS PAGE 
gradient gel and then transferred to a PVDF membrane. The 
membrane was blocked in 5% milk and incubated with 1:1000 
anti-ASA serum diluted, 1:5000 anti-a-tubuUn monoclonal anti- 
body B-5-1-2 (Sigma) or 1:1000 anti-Myc monoclonal antibody 
9E10 (Covance). Bound antibody was detei:ted 1)y In'bridizing to 
1:1000 anti-rabbit or anti-mouse biotin-conjugated secondary 
antibody followed by streptavidin conjugated DyLight 800 
(Thermo Scientific) and visualized on an Odyssey Infrared 
Imaging System (LI-COR Biosciences). 

DNA Binding Assays 

DNA binding by electrophoretic mobility shift assay was 
determined by incubating 0.5 pmol of 5'-fluorescein-conjugated 
(T)8CG(T)6 DNA oligonucleotide at 4°C witii 2 Hg rASA in 10 jxl 
deaminase buffer for 1 h. The samples were separated by 6% 
native PAGE and visualized on a Typhoon Trio imager (GE 
Healthcare). DNA binding was quantified by measuring the 
intrinsic fluorescence of 225 nM rASA with increasing amounts of 
oligodeoxynucleotide in 25 mM Tris pH 8.0 and 25 mM NaCl on 
a FluoroMax-4 spectrofluorimeter [28,36]. The samples were 
excited at 295 nm and emission was detected at 345-355 nm. 
The extrinsic fluorescence of a fluorescein-conjugated ssDNA 



oligodeoxynucleotide with an increasing concentration of rASA 
was measured in triplicate on a Perkin Ekner Envision plate 
reader. The samples were excited at 485 nm and emission was 
detected at 535 nm. The change in intrinsic and extrinsic 
fluorescence was plotted against the concentration of the unlabeled 
oligodeoxynucleotide and rA3A respectively using SigmaPlot 
software. A one-site saturation ligand-binding curve was fitted 
for each plot and the was determined based on the ligand- 
binding curve. 

ASA and A3G Structure Analysis 

ASA NMR structure (2M65, [38]) and ASG catalytic domain 
crystal structure (SEIU, [S9]) were analyzed using PyMOL 
software. 

Ethics Statement 

Anti-ASA antiserum was prepared by Pocono Rabbit Farm and 
Laboratories (PRF&L) under the protocol PRF2A approved by 
PRF&L Institutional Animal Care and Use Committee (lACUC). 
PRF&L is accredited by the Association for Assessment and 
Accreditation of Laboratory Animal Care (AAALAC) and by the 
National Institutes of Health (NIH) Office of Laboratory Animal 
Welfare (OLAW), assurance number AS886-01 expiration January 
31, 2017. 

Supporting Information 

Figure SI rA3A monomer and dimer fraction are 
catalytically active. Deaminase activity of 1.0 |J,g ASA protein 
extract or 1 0 |tl of each size exclusion chromatography fraction 
was determined by incubation with an ohgonucleotide containing 
a TCA consensus target sequence. The results are representative 
of two independent repetitions using different batches of rASA. 
(TIF) 

Figure S2 Comparative ASA and ASG structure. A) 

Alignment of ASA and the carboxy terminal catalytic domain of 
ASG primary sequence against ASG secondary structure. The 
sequences were aligned using ClustalW2 (www.ebi.ac.uk/Tools/ 
msa/clustalw2/). Identical amino acids are in white on a red 
background. ASG secondary structure was extracted from its 
crystal structure. The a-helices are represented above their 
corresponding primary sequence by a drawn helix, while arrows 
represent fi-sheets. Each residue mutated in ASA is indicated with 
a triangle containing the matched color from the 3D model of 
Figures 5 and 6. A black bar below the sequence delimits RLl and 
2. B) A 3D model of ASA displaying the tertiary structure is 
displayed. For each amino acid mutated or swapped loop, an 
arrow indicates its position. 
(TIF) 

Figure S3 The rabbit anti-ASA antibody does not cross- 
react with other members of the APOBEC3 family. 29ST 

were transfected with an empty pcDNAO vector (EV) or vectors 
encoding HA-tagged ASA, APOBEC3B, APOBEC3C, ASF and 
ASG. Lysates were separated by SDS-PAGE, and blotted with 
mouse anti-HA antibody, to control for protein expression, or a 
rabbit antiserum raised against rASA. 
(TIF) 

Figure S4 W104A is catalytically inactive. Wild-type and 
mutated ASA were expressed in transfected 29ST cells, immuno- 
precipitated from cell lysates and then tested for deaminase activity 

against an oligonucleotide containing a TCG target sequence. The 

activity was measured at the indicated time points. 

(TIF) 
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